Guaranteeing Convergence of Iterative Skewed Voting Algorithms 

for Image Segmentation 



Dora C. Balcan^'*, Gowri Srinivasa'', Matthew Fickus'^, Jelena Kovacevic'' 

"School of Interactive Computing, Georgia Institute of Technology, Atlanta, USA 
''Dept. of Information Science and Engineering, and Center for Pattern Recognition, PES School of Engineering, Bangalore, India 
'^Dept. of Mathematics and Statistics, Air Force Institute of Technology, Wright-Patterson AFB, USA 
''Dept. of Biomedical Eng., Electrical and Computer Eng. and Center for Bioimage Informatics, Carnegie Mellon University, Pittsburgh, USA 



Abstract 

In this paper we provide rigorous proof for the convergence of an iterative voting -based image segmentation algorithm 
called Active Masks. Active Masks (AM) was proposed to solve the challenging task of delineating punctate patterns 
of cells from fluorescence microscope images. Each iteration of AM consists of a linear convolution composed with 
a nonlinear thresholding; what makes this process special in our case is the presence of additive terms whose role is 
to "skew" the voting when prior information is available. In real-world implementation, the AM algorithm always 
converges to a fixed point. We study the behavior of AM rigorously and present a proof of this convergence. The 
key idea is to formulate AM as a generalized (parallel) majority cellular automaton, adapting proof techniques from 
discrete dynamical systems. 
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1. Introduction 



Recently, a new algorithm called Active Masks (AM) was proposed for the segmentation of biological images ||14|] . 
Let the "image" / be any real-valued function over the domain Q := Y]d=i '^Nj and refer to the := N\N2- ■ -No 
elements of Q as pixels; here, 1,^,1 denotes the finite group of integers modulo A^^. A segmentation of / assigns one of 
M possible labels to each of the pixels in Q.. For the fluorescence microscope image depicted in Figure [Tl a), one 
example of a successful segmentation is to label all of the background pixels as "1," assign label "2" to every pixel in 
the largest cell, "3" to every pixel in the second largest cell, and so on. Formally, a segmentation is a label function 
if) : Q {1,2,..., M], or, equivalently, a collection of M binary masks : Q. —> {0, 1 ) where, at any given n e O, 
we have /im(«) = 1 if and only if tp{n) = m. That is, yu,„ at any iteration / can be defined as 



1, ^i{n)-m, 
0, iA((«) ^ ni. 



In AM, these masks actively evolve according to a given rale. To understand this evolution, it helps to first discuss 
iterative voting: in each iteration, at any given pixel, one counts how often a given label appears in the neighborhood 
of that pixel — weighting nearby neighbors more than distant ones — and assigns the most frequent label to that pixel 
in the next iteration. For example, if a pixel labeled "1" in the current iteration is completely surrounded by pixels 
labeled "2", its label will likely change to "2" in the next iteration. Formally speaking, iterative voting is the repeated 
application of the rule: 

Iterative Voting: lAXn) - argmax [(/^J^"'^ * g)in)\, (1) 

\<m<M 
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where / is the index of the iteration, ^ : Q — > M is some arbitrarily chosen fixed weighting function and "*" denotes 
circular convolution over Q.. Iterative voting is referred to as a convolution-threshold scheme since it simplifies to 
rounding the filtered version of /i*'' in the special case M - 2. Experimentation reveals that for typical low-pass filters 
g, repeatedly applying ([T]i to a given initial t//o results in a progressive smoothing of the contours between distinctly 
labeled regions of Q. Despite this nice property, note that taken by itself, iterative voting is useless as a segmentation 
scheme, as ([T) evolves masks in a manner that is independent of any image under consideration. 

The AM algorithm is a generalization of ([TJ that contains additional image-based terms whose purpose is to drive 
the iteration towards a meaningful segmentation. To be precise, the AM iteration is: 

Active Masks: </',■(«) = argmax * g)(n) + /?„,(«)], (2) 

l<m<M 

where the region-based distributing functions {Rm]ff,^i can be any image-dependent real-valued functions over O. 
These will be referred to as skew functions in this paper, due to their role to bias the voting. Essentially, at any given 
pixel n, these additional terms skew the voting towards labels m whose Rm(n) values are large. For good segmentation, 
one should define the R„,'s in terms of features in the image that distinguish regions of interest from each other 

For example, for the fluorescence microscope image given in Figurellla), the cells appear noticeably brighter than 
the background. As such, we choose Ri to be a soft-thresholded version of the image's local average brightness, and 
choose the remaining R„'s to be identically zero. When (|2]i is applied, such a choice in R^s forces pixels which lie 
outside the cells towards label "1," while pixels that lie inside a cell can assume any other label. Intuitively, repeated 
applications of (|2]i will cause the mask //j'^ to converge to an indicator function of the background, while each of 
the other masks {f^m}m=2 converges either to a smooth blob contained within the foreground or to the empty set. 
Experimentation reveals that the AM algorithm indeed often converges to a i/r which assigns a unique label to each 
cell provided the scale of the window g is chosen appropriately lil4ll : see Figure [T]for examples. 

The purpose of this paper is to provide a rigorous investigation of the convergence behavior of the AM algorithm. 
To be precise, we note that in a real-world implementation the AM algorithm occasionally fails to converge to a tfr 
which is biologically meaningful. However, even in these cases, the algorithm always seems to converge to something. 
Indeed, when i/tq and the /?,„'s are chosen at random, experimentation reveals that the repeated application of (|2]i always 
seems to eventually produce (/r,- such that if/i+i = i/r,-. At the same time, a simple example tempers one's expectations: 
taking Q. - Z4, M - 2, w - 6-1 -H i5o + i5i and Ri = R2 = 0, we see that AM will not always converge, as repeatedly 
applying (|2|i to tfro - 60 + 62 produces the endless 2-cycle ^0-1-^2 6\ + 6^ i-> 5o + ^2- In summary, even though 
random experimentation indicates that AM will almost certainly converge, there exist trivial examples which show 
that it will not always do so. The central question that this paper seeks to address is therefore: 

Under what conditions on g and {Rm]^^i will the AM algorithm always converge to a fixed point of ^7 

We show that when g is an even function, AM will either converge to a fixed point or will get stuck in a 2-cycle; 
no higher-order cycles are possible. We can further rule out 2-cycles whenever g is taken so that the convolutional 
operator / / * g is positive semidefinite. The following is a compilation of these results: 

Theorem 1.1. Given any O := Y\d=i "^n^, initial segmentation tj/Q : O — > {1, . . . , M] and any real-valued functions 
{^ml^^i over D., the Active Mask algorithm, namely the repeated application of will always converge to a fixed 
point of (|2|l provided the discrete Fourier transform of g is nonnegative and even. 

A preliminary version of the results in this paper appears in the conference proceeding • Though the specific AM 
algorithm was introduced in [T?], iterative lowpass filtering has long been a subject of interest in applied harmonic 
analysis, having deep connections to continuous-domain ideas such as diff'usion and the maximum principle j^. 
For instance, (|9|] gives an edge detection application of a discretized version of these ideas. Meanwhile, (jg] gives 
diffusion-inspired conditions under which lowpass filtering is guaranteed to produce a coarse version of a given image. 
One way to prove the convergence of iterative convolution-thresholding schemes is to show that lowpass filtering 
decreases the number of zero-crossings in a signal; such a condition is equivalent to a version of the maximum 



principle jl ill . More recently, the continuous-domain version of ([T) has been used to model the motion of interfaces 



between media lll2ill3ll ; in that setting, ([TJ is known to converge if M -2. Since the AM algorithm is iterative, many 



of the proof techniques we use here were adapted from majority cellular automata (MCA), a well-studied class of 
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(a) Original image (b) i=0, M=256 



scales 4 scales 16 scale= 32 




(c) Segmentation outcomes for various scales 



Figure 1: Active mask segmentation of punctate patterns of proteins 1 14]. (a) Original image (courtesy of Linstedt Lab |[l|) 
(b) Initialization using M=256 random masks. After one iteration of (|2), the background is separated from the foreground by 
the region-based skew function R]. (c) Segmentation results using various scales of the voting filter at iterations 2, 8, 14 and at 
convergence. The first row shows a cross section of three Gaussian filters (scale=4, 16 and 32 respectively). The second row shows 
the segmentation result after two iterations of the algorithm. When scale=4, we observe a large number of small regions in the 
foreground. This is contrasted by the fewer number of regions when scale=16 and scale=32. Subsequent rows represent the state 
of the system at various stages of evolution. The last row represents the convergence states. Note that the algorithm converges 
regardless of the scale chosen, but the segmentation is only biologically meaningful at the proper scale of 16; at scales=4, 32 the 
cells are oversegmented or undersegmented, respectively. 
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discrete dynamical systems. Indeed, theoretical guarantees on the convergence of a symmetric class of MCA have 
been known for several decades; see IlKI^l, and references therein. Such results were recently generalized to a quasi- 
symmetric class via the use of Lyapunov functionals Whereas much of traditional MCA theory focuses on the 
convergence of repeated applications of ([T}, our work differs due to the presence of the additive R,,, terms in (|2|i. 

The paper is organized as follows. In the next section, we use an MCA formulation of AM to prove our main 
convergence results. In Section 3, we then briefly discuss the generalization of our main results to a less elegant 
yet more realistic version of (|2) involving noncircular convolution. We conclude in Section 4 with some examples 
illustrating our main results, as well as some experimental results indicating the AM algorithm's rate of convergence. 

2. Active Masks as a Majority Cellular Automaton 

Cellular automata are self-evolving discrete dynamical systems |3]. They have been applied in various fields such 
as statistical physics, computational biology, and the social sciences. A tremendous amount of work in this area has 
focused on studying the convergence behavior of various types of automata. In this section, we formulate the AM 
algorithm (|2]i as an MCA in order to facilitate our understanding of its convergence behavior To be precise, we 
consider a generalization of Q in which the convolutional operator / i-» / * ^ may more broadly be taken to be any 
linear operator A from ((Q.) := {f\f : Q. M.] into itself: 

Mn) min(argmax[(A;/(r")(«) + RM]), 14^'' := { J] 1^;^; J^j ^ ^ (3) 

Here, the contribution of mask m in deciding the outcome at location n at iteration / is (AjUm" '')(«), and any ties are 
broken by choosing the smallest m corresponding to a maximal element. Note that given any initial segmentation i/'o, 
applying Q ad infinitum produces a sequence {lA/lJ^o- However, as there are only distinct possible configurations 
for i/r : Q — » { 1 , . . . , M], this sequence must eventually repeat itself. Indeed, taking minimal indices io and A" > such 
that i^i„+K - lAio' the deterministic nature of (|3) implies that 1/^,+^ = i/r,- for all / > io. The finite sequence ' is 

called a cycle of (O of length K. Note that {(A/lJ^o converges if and only if K - 1, which happens precisely when (/f,,, 
is a fixed point of Q. 

Thus, from this perspective, proving that ([3) always converges is equivalent to proving that K = I regardless of 
one's choice of tfro. The following result goes a long way towards this goal, showing that if A is self-adjoint, then for 
any iJ/q we have that the resulting K is necessarily 1 or 2. That is, if A is self-adjoint, then for any i/tq, the sequence 
will either converge in a finite number of iterations, or it will eventually come to a point where it forever 
oscillates between two distinct configurations i/^/^ and tf^k+i- 

Theorem 2.1. If A is self-adjoint, then for any the cycle length K of ^ is either 1 or 2. 

Proof. As we are not presently concerned with the rate of convergence of (|3]l, but rather the question of whether it 
does converge, we may assume without loss of generality that {lA/li^o akeady entered its cycle. That is, we reindex 
so that iAo is the beginning of the ^T-cycle, and heretofore regard all iteration indices as members of the cyclic group 
"Lk- We argue by contrapositive, assuming K > 2 and concluding that A is not self-adjoint. For any i - 1, . . . , K, ^ 
is equivalent to the system of inequalities: 

iAn'-^'J,)(n) + «^,(„)(n) > (A;u(;-")(«) + R„,(n) if 1 < m < (4a) 

(A;i|;-^|)(n) + /?^,(„)(«) > (A;u(;-")(«) + R,„(n) if ^n) <m<M. (4b) 

Here, (l4bT l follows from the fact that (A,(n) is a value of m that maximizes (Af/^^^)(n) + Rm{n). Moreover, in the event 
of a tie, i//iin) is chosen to be the least of all such maximizing m, yielding the strict inequality in (l4al l. For any / and n, 
picking m - i/',-2(n) in (l4al i and (l4b] i leads to the subsystem of inequalities: 

(At/^;)(n) - (A;u^-j|„))(n) + «^,(„)(«) - /?^,_,(„)(n) > if ^,-2(«) < i/^i(n), (5a) 

(A|i^-;;j)(«) - (A;u^-j|„))(«) + /;^,(„)(«) - /;^,_,(„)(«) > if ^^,_2(n) > if'iin). (5b) 
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Now since K > 2, there exists a pixel n for which [ijjoiri), t//i(n), . . ., tfrK-\(n)] is not of the form {a, a, ... , a] nor of 
the form {a, b, a,b, . . .a, b). At such an n, there must exist an / such that tf'i-iin) < if/iin). Consequently, at least one 
inequality in (|5]l is strict. Thus, summing (|5]l over all pixels n and all cycle indices / yields: 

/eZjc nen ieZ^ nen ieZ^ neCl (eZa- nen 

Since is shift-invariant, ^ R,i,i(n)(n) - ^ ^iAi-2(«)(") for n e D., reducing the previous equation to: 

< Z - Z - Z Z'^^w^i)^") - Z Z(^^® 

;6Za- nen ieZ^ nefl /eZjc nen /eZ^ nen 

where the final equality also follows from the shift-invariance of Z^. To continue, note that for any /, j e Zk we have 
f/£ = 1 if and only if (/')(«) = m and so: 



M M 



nen neQ m-1 m=\ 

where (/, g) \- ^ f(n)g(n) is the standard real inner product over Using (|7]i in Q gives: 



w^fi .* 
M M M 



< J] z<^^^"'^^> - Z Z<^'"^'^^"> = Z Z«^ ^®>' 

ieZx m=l /eZs; m=l ieZjr m=l 

implying A - A* 0, and so A is not self-adjoint. □ 

Theorem 12. II has strong implications for the AM algorithm (|2). Indeed, it is well-known that if g is real-valued, 
then the adjoint of the convolutional operator Af = / * ^ is A*/ - f * g where g{n) - g(-n) is the reversal of g. 
As such, if g is an even function. Theorem 12. 1 1 guarantees that AM will always either converge or enter a 2-cycle. 
We now build on the techniques of the previous proof to find additional restrictions on A which suffice to guarantee 
convergence: 

Theorem 2.2. If A is self-adjoint and {Af, f) > for all / : Q — > {0, + 1 }, then ^ always converges. 



Proof In light of Theorem 12. II our goal is to rule out cycles of length K = 2. We argue by contrapositive. That is, 
we assume that there exist two distinct configurations i^o and i/'i which are successors of each other, and will use this 
fact to produce / : Q -> {0, +1) such that (A/,/) < 0. Substituting / - and m - i//\(n) into (|4al) and (l4b] ) yields: 

(A;z^'^'(„j)(n) - (A/z^'_'(„j)(n) + -R^„(„)(n) - /;^,(«)(«) > if <Ai («) < «Ao(n), (8a) 

(A;z^'^'(„j)(n) - (A/z^'_'(„j)(n) + -R^„(„)(n) - /;^,(«)(«) ^0 if -Ai («) > M^)- (8b) 

Similarly, letting / - 1 and m - i//o(n) into ( l4at and ( l4bl i yields: 

(A^i'-^\„^)(n) - (A//^°'(„))(n) + «^,(„,(«) - /?^„(„)(«) > if Mn) < <Ai («), (9a) 

(A//^°'(„P(«) - (Anf^^„^)(n) + R^Min) - RMn)(n) > if Mn) > <Ai («)• (9b) 



Since (/ro and are distinct, there exists no 6 such that i/'o(no) i/'i(no)- If 'A{)(n{)) < i/'i (no), we sum (l8b] i and ( l9al i 
over all « e f2. If on the other hand i//o(no) > if/i (no), we sum dSal i and (l9b] i over all n € O. Either way, we obtain: 

N 

< Z[(^^W«))^«) - (^^i.?(n))(«) + (^<(n))(«) - 

«=I 

Applying (|7]l four times then gives: 

M M 

m=l m=l 

As such, there exists at least one index mo such that > {A(ju|,5|J - yUm„'), (jUm'o ~ /"mo)); choose / to be yuj,^ - /i'^i,'. □ 
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The most obvious way to ensure that (A/, /) > for all / : f2 ^ {0, +1 ) is for A to be positive semidefinite, that is, 
{Af, /) > for all / : f2 — > M. This in turn can be guaranteed by taking A to be diagonally dominant with nonnegative 
diagonal entries, via the Gershgorin circle Theorem [5]. Note that in fact strict diagonal dominance guarantees that 
iterative voting ([T]i always converges in one iteration. More interesting examples can be found in the special case 
where A is a convolutional operator Af - f * g- Indeed, letting F be the standard non-normalized discrete Fourier 
transform (DFT) over Q., we have: 

{Af, f) ^{f*gj)^ i<F(/ * g), F/> = ^<(F/)(Fg), F/) = i Yj{¥g){n)\(¥f){n)\\ (10) 

neil 

As such, if g : O M is even and i¥g)in) > for all n eD., then A is self-adjoint positive semidefinite. Moreover, it is 
well-known that g is real-valued and even if and only if ¥g is also real-valued and even. Thus, A is self-adjoint positive 
semidefinite provided ¥g is nonnegative and even. For such g, Theorem l2 .2 1 guarantees that the AM algorithm (|2) will 
always converge. These facts are summarized in Theorem ll.il which is stated in the introduction. Examples of g that 
satisfy these hypotheses are given in Section|4] 

We emphasize that Theorem l2.2l does not require A to be positive semidefinite, but rather only that (A/, /) > for 
all / : Q ^ {0, +1}. In the case of convolutional operators, this means we truly only need ( fTOt to hold for such /'s. 
As such, it may be overly harsh to require that {Fg){n) > for all n e D.. Unfortunately, the problem of characterizing 
such ^'s appears difficult, as we could find no useful frequency-domain characterizations of {0, +l}-valued functions. 
A spatial domain approach is more encouraging: when Q. = 1^^, writing f : Q. —> {0, ±1} as the difference of two 
characteristic functions;^/, ,xi2 ■ ^ {0, 1}, we have: 

<A/,/) = {A(xj, -XhJ'OCi, -Xh)) = sum(Ai,i) + sum(A2,2) - sum(Ai,2) - sum(A2,i), 

where sum(A, j) denotes the sum of all entries of the submatrix of A consisting of rows from /, and columns from Ij. 
As such, the condition of Theorem l2.2l reduces to showing that < sum(Ai i) + sum(A2,2) - sum(Ai_2) - sum(A2,i) for 
all choices of subsets /, and Ij of I^n- 

We conclude this section by noting that (O is similar to threshold cellular automata (TCA) flQl- In fact, (O is 
equivalent to TCA in the special case of M = 2; in this case, = 1 - //j'"'^(n) for all n e Q, implying: 

(A//(;-'^)(«) + R,{n) > {A^ll'\n) + R,,{n) ^ [A(fi^'-'^ - ^ll'^)]{n) + {R, - RqXh) > 

^ {a[m^I-'> - (1 - + (Ri - Ro)(n) > 

^ (A^*r")(n) + ^(Ri -Ro- Al)(n) > 

<^ {AiJ^'^^)(n) + b{n) > 0, 

where b(n) :- j(Ri - Rq - Al)(«). That is, when M - 2, the AM algorithm is equivalent to a threshold-like decision. 
But whereas the traditional method for proving the convergence of TCA involves associated quadratic Lyapunov 
functionals our method for proving the convergence of AM is more direct, being closer in spirit to that of lliol . 



3. Beyond symmetry 

Up to this point, we have focused on the convergence of ^ in the special case where A is self-adjoint. In this 
section, we discuss how Theorems 12.11 and 12.21 generalize to the case of quasi-self adjoint operators, which arise in 
real-world implementation of the AM algorithm. To clarify, up to this point, we have let the image / and weights g 
be functions over the finite abelian group Q = Y\d=i '^Ki have taken the convolutions in ^ and Q to be circular. 
In real-world implementation, the use of such circular convolutions can result in poor segmentation, as values at one 
edge of the image are used to influence the segmentation at the unrelated opposite edge. 

One solution to this problem — implemented in 1 14] — is to redefine the set of pixels as a subset Q := Ud^il^^^d) of 
the D-dimensional integer lattice Z^, and regard our image / as a member of ^(Q) := {/ : — » R | /(n) - Q'in i £1}. 
Here, the label function ip and masks fj.,„ are regarded as { 1 , . . . , M]- and {0, 1 [-valued members of {(SI), respectively, 
and the (noncommutative) convolution of any /, g E {(Q.) with g e fiilP) is defined as / ★ g e ^(Q), 

(/★g)(«):^ / %y Vnea (11) 

(xo. * gm) 
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where xn is the characteristic function of Q, and * denotes standard (noncircular) convolution in (^{iP). For the 
theory below, we need to place additional restrictions on g, namely that it belongs to the class: 

^(Q) := [g € ^2(Z«) : (xn * g){n) > V« € O). 

In this setting, for a given g € ^(Q), the AM algorithm (|2]l becomes: 

Noncircular Active Masks: = argmax [(^u^-" ★ ^)(n) + /?,„(«)], ^^-i> := i ^' '/'/-i(n) = (^2) 



\<m<M 



0, ipi-\{n) + m. 



Note that the use of the ★-convolution in ( fT2l i ensures that any "missing votes" are not counted in favor of any label 
m. Moreover, the denominator of dTTt ensures that when n is close to an edge of Q, the weights in the g-neighborhood 
of n are rescaled so as to always sum to one. This rescaUng ensures that 2m=i(y"l? * g)(n) = 1 for all n e Q, avoiding 
any need to modify the skew functions R,„ near the boundary. 

We then ask the question: for what g will (fT2l l always converge? The key to answering this question is to realize 
that the ★-filtering operation Af - f * g can be factored as A = DB, where B is the standard filtering operator 
Bf = f * g and {Df){n) - A„f{n), where A„ = [(xn * ^)(«)] Here, A, B and D are all regarded as linear operators 
from {(Q.) into itself. More generally, we inquire into the convergence of: 

^,(«) = argmax [(Af,t'^)(n) + R,„(n)l fj^'^ := ( ^' f '-'^"] Z !!' (13) 

l<m<M ( ^' Vz-K"; ^ "h 

where A - DB and D is positive-multiplicative, that is, (Df)(n) - A„ f(n) where A„ > for all n € Q. In particular, we 
follow [f\ in saying that A is quasi-self-adjoint if there exists a positive-multiplicative operator D and a self-adjoint 
operator B such that A = DB. This definition in hand, we have the following generalization of Theorems 12.11 and 12 . 21 

Theorem 3.1. Let A be quasi-self-adjoint: A — DB where D is positive-multiplicative and B is self-adjoint. Then for 
any ipo, the cycle length K of (I13l l is either 1 or 2. Moreover, if B is positive-semidefinite, then (I13l l always converges. 



Proof We only outline the proof, as it closely follows those of Theorems 12.11 and 12.21 Let (Df){n) - A„f(n) with 
A„ > for all n e Q. We prove the first conclusion by contrapositive, assuming K > 2. Rather than summing (|5]l over 
all n and / directly, we instead first divide each instance of (|5]l by the corresponding A„, and then sum. The resulting 
quantity is analogous to (|6]l: 

< Z Z T^^^^^ - Z Z ^^A'^ = Z Z(<«i)(«) - Z Z(^^^L(«))(«)- (14) 

Simplifying the right-hand side of ( fT4l l with O quickly reveals that B cannot be self-adjoint, completing this part of 
the proof. For the second conclusion, we again prove by contrapositive, assuming K = 2. Dividing dSab . dSbl l. ( |9a] l 
and ( |9b] l by A„ and then summing either dSat and (|9b] i over all n or (l8bl i and ( |9al i over all n gives: 

W , M 
«=1 m=l 

implying B is not positive semidefinite. □ 

For a result about the convergence of (IT2t . we apply Theorem 13 . 1 1 to A = where A„ = [(^n * ^)(n)] ' and 
B/ - f * g. Note that we must have ^ € in order to guarantee that D is positive. Moreover, B is self-adjoint 
if ^ € (^(iP) is even; since ^ is real-valued, this is equivalent to having its classical Fourier series g e L^(T^) be 
real-valued and even. Meanwhile, since: 

{Bf, f) = </ * g, f) = {fg, f) = r g{x)\f(x)f dx, 

Jt'' 

then B is positive semidefinite if g{x) > for almost every x € T^. To summarize, we have: 

Corollary 3.2. //f/ie Fourier series of g e ^(fi) is nonnegative and even, then (I12l l w/// aZwayi converge. 

In the next section, we discuss how to construct such windows g, along with other implementation-related issues. 
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4. Examples of Active Masks in practice 

In this section we present a few representative and interesting examples of filter-based cellular automata, and 
discuss their behavior in relation with the results we proved in the previous sections. We also present some preliminary 
experimental findings on the rate of convergence of AM. For ease of understanding, let us for the moment restrict 
ourselves to circulant iterative voting ([T), namely the version of AM (|2]l in which all the skew functions R,„ are 
identically zero. The simplest nonzero filter is g = Sq. The DFT of 60 has constant value 1, and is therefore nonnegative 
and even. As such. Theorem 11.11 guarantees that ([TJ will always converge. Of course, we already knew that: since 
f *(>o - f for all / e ((Q.), ([TJ will always converge in one step; as noted above, the same holds true for any g whose 
convolutional operator is strictly diagonally dominant with a nonnegative diagonal: g{Q) > Yjni=o 

More interesting examples arise from box filters: symmetric cubes of Dirac 6's. For instance, fix > 3 and 
consider ([T]i over Q. = where g - S-i + 60 + Si. Since g is symmetric. Theorem 12. 1 1 guarantees that dl} will 
either always converge or will enter a 2-cycle. However, if is even, then ([T]i will not always converge, since 

i/'o = 5o + ^2 H 1- Sn-2 generates a 2-cycle. This phenomenon is depicted in Figure|2ja). This simple example shows 

that symmetry alone does not suffice to guarantee convergence; one truly needs additional hypotheses on g, such as 
the requirement in Theorem 1 1.1 1 that its DFT is nonnegative. This hypothesis does not hold for g = S-i + 60 + 61, 
smce (Fg)(n) = 1 H-2cos(^). Similar issues arise in the two-dimensional setting Q = Za(, x Zat, : both the 3 x 3 
box filter (Moore's automaton, see Figure |2jb)) and the "plus" filter (von Neumann's automaton, see Figure |2jc)) 
are symmetric, meaning their cycle lengths are either 1 or 2, but neither are positive semidefinite, having DFTs of 
[1 + 2cos(^)][l -I- 2cos(^)] and 1 H- 2cos(^) -H 2cos(2^), respectively. Indeed, when A^i and N2 are even, 
alternating stripes generate a 2-cycle for the box filter, while the checkerboard generates a 2-cycle for the plus filter 

Of course, it is not difficult to find filters g which do satisfy the hypotheses of Theorem ll.il one may simply let 
g be the inverse DFT of any nonnegative even function. More concrete examples, such as a discrete Gaussian over 
Zff, can be found using the following process. Let /; : K — > M be an even Schwartz function whose Fourier transform 
is nonnegative; an example of such a function is a continuous Gaussian. Let g be the A^-periodization of the integer 
samples of h, namely g{n) :- 2U?=_co h{n + Nn'). Then g is even, and moreover, by the Poisson summation formula: 

A'— 1 CO 00 00 

(Fg)(«) = Yj = Z Z + = Yj '^(^)e"'^ = Z + - 

rt'=0 /7'=()n"=-oo k=-co k=-oo 

In particular, if g is chosen as a periodized version of the integer samples of any zero-mean Gaussian, then Theorem ll.il 
gives that the AM algorithm ^ necessarily converges. This construction method immediately generalizes to higher- 
dimensional settings where D > 1. It also generalizes to the noncircular convolution setting considered in Section [3] 
There, we further restrict h to be strictly positive, and let g be the integer samples of h. The positivity of h implies 
(Xn * g){n) > for all n e O, implying g e ^(O) as needed. Moreover, g is even and the Poisson summation formula 
gives that its Fourier series is nonnegative: g{x) = YjT=-oo ^(^ + x) > 0. Any g constructed in this manner satisfies the 
hypotheses of Corollary 13.21 implying the corresponding noncirculant AM (fT2l i necessarily converges. 

4.1. The rate of convergence of the AM algorithm 

Up to this point, we have focused on the question of whether or not the AM algorithm ^ converges. Having 
settled that question to some degree, our focus now turns to another question of primary importance in real-world 
implementation: at what rate does AM converge? Experimentation reveals that this rate highly depends on the con- 
figuration of the boundary between two distinctly labeled regions of Q.. This led us to postulate that the number 
of boundary crossings (see Figure O should monotonically decrease with each iteration. Experimentation reveals 
that this number indeed often decreases extremely rapidly, regardless of the scale of g. Figure |4] depicts such an 
experiment for the fluorescence microscope image shown in Figure [TJ a). Starting from a random initial configura- 
tion of 64 masks, we used a Gaussian filter under three different scales, with each plot depicting the evolution of 5 
independently-initialized runs of the algorithm. We emphasize the algorithm's fast rate of convergence: the vertical 
axis represents a nested four-fold application of the natural logarithm to the number of boundary crossings. We leave 
a more rigorous investigation of the AM algorithm's rate of convergence for future work. 
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(a) 3-tap filter automaton 
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(b) Moore's automaton 
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(c) von Neumann's automaton 



Figure 2: An illustration of oscillating states produced by various automata: (a) The 3-tap box filter (5_i + 60 + 6\ over Q = Z4. 
Using this g in Q with ifiQ = 60 + 62 results in the endless 2-cycle do + 62 1-^ Si + S3 1-^ 60 + 62- This is because at each iteration, 
each pixel's two neighbors will outvote him in deciding his label in the next iteration, (b) Convergence is also an issue in two 
dimensions, as illustrated by Moore's automaton — a 3x3 box filter — over fl = Z4 x Z4. (c) Two-cycles persist in two dimensions 
even when the box filter is replaced by the smoother "plus" filter of von Neumann's automaton. In all three cases, these filters are 
even and so Theorem l2. 1 [ ensures that the cycle length A" of l[Tll is either 1 or 2. However, none of them are positive semidefinite, 
as their DFTs attain negative values. As such, the convergence guarantees of Theorem l2.2l do not hold. 




Figure 3: An illustration of the zero-crossings in an image with M=3 masks. 
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Figure 4: The rate of decrease of the AM algorithm in terms of the number of boundary crossings. 



Acknowledgments 

We thank Prof. Adam D. Linstedt and Dr. Yusong Guo for providing the biological images which were the original 
inspiration for the AM algorithm and this work. Fickus and Kovacevic were jointly supported by NSF CCF 1017278. 
Fickus received additional support from NSF DMS 1042701 and AFOSR F1ATA00183G003, F1ATA00083G004 and 
Fl ATA0035J001 . Kovacevic also received support from NIH R03-EB008870. The views expressed in this article are 
those of the authors and do not reflect the official policy or position of the United States Air Force, Department of 
Defense, or the U.S. Govemment. 

References 

[1] Linstedt Lab at CMU. http://www.cmu.edu/bio/faculty/lmstedt.html. 

[2] D.C. Balcan, G. Srinivasa, M. Fickus, and J. Kova£evi£. Convergence behavior of the Active Mask segmentation algorithm. In IEEE ICASSP, 
2010. 

[3] E. Golds and S. Martinez. Neural and Automata Networks. Dynamical Behavior and Applications. Kluwer, 1990. 

[4] E. Goles-Chacc, Fr. Fogelman-Soulie, and D. Pellegrin. Decreasing energy functions as a tool for studying threshold networks. Discrete 

Applied Mathematics, 12:261-277, 1985. 
[5] G. Golub and Ch. L. Van Loan. Matrix Computations. Johns Hopkins University Press, London, 3rd edition, 1996. 
[6] J.J. Koenderink. The structure of images. Journ. Bio. Cyhern., 50(5):363-370, 1984. 

[7] P. Melatagia Yonta and R. Ndoundam. Opinion dynamics using majority functions. Mathematical Social Sciences, 57:223-244, 2009. 
[8] L. Nirenbarg. A strong maximum principle for parabolic equations. Commun. Pure and Appl. Math., 6:167-177, 1953. 
[9] P. Perona and J. Malik. Scale-space and edge detection using anisotropic dilfusion. IEEE Trans. Patt. Anal, and Mach. Intelligence, 12(7):629— 
639, 1990. 

[10] S. Poljak and M. Sura. On periodical behaviour in societies with symmetric influences. Combinatorica, 3(1):1 19-121, 1983. 

[11] R. A. Hummel. Readings in computer vision: issues, problems, principles, and paradigms, chapter Representations based on zero-crossings 

in scale-space, pages 753-758. Morgan Kaufmann Readings Series archive. Morgan Kaufmann PubUshers, Inc., CA, USA, 1987. 
[12] S. J. Ruuth. Efficient algorithms for diffusion-generated motion by mean curvature. Journ. Comp. Phys., 144:603-625, 1998. 
[13] S. J. Ruuth and B. Merriman. Convolution-thresholding methods for interface motion. Journ. Comp. Phys., 169:678-707, 2001. 
[14] G. Srinivasa, M. Fickus, Y. Guo, A. Linstedt, and J. KovaCevid. Active mask segmentation of fluorescence microscope images. IEEE Trans. 

Imag. Proc, 18(8):1817-1829, 2009. 



10 



